Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
馃幆 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
82894
posts in
663.8
ms
Reinforcement
Learning from Human
Feedback
arxiv.org
路
16h
馃幆
Predictive Coding
Hybrid neural鈥揷ognitive models reveal how memory
shapes
human
reward
learning
nature.com
路
21h
馃幆
Predictive Coding
Quantization-Aware
Distillation
ternarysearch.blogspot.com
路
4h
路
Discuss:
Hacker News
馃攧
Meta-Learning
On
Computation
and
Reinforcement
Learning
arxiv.org
路
2d
馃幆
Predictive Coding
Hybrid Model鈥態ased / Model鈥慒ree Reinforcement Learning for Energy鈥慐fficient Autonomous Warehouse Robot Navigation with Real鈥慣ime
Obstacle
Prediction **
Abstra
...
freederia.com
路
1d
馃
Robotics
Learning Models with Uniform Performance via
Distributionally
RobustOptimization
dev.to
路
18h
路
Discuss:
DEV
馃幆
Predictive Coding
Deep reinforcement learning-based energy scheduling for green buildings with
stationary
and EV batteries of heterogeneous
characteristics
sciencedirect.com
路
1d
馃
Neuromorphic Computing
Continual
learning and the post
monolith
AI era
baseten.co
路
1d
路
Discuss:
Hacker News
馃
Neuromorphic Hardware
Part 5: Reward Engineering: How to Shape
Behaviors
in
Financial/Robotic
Tasks
dev.to
路
2d
路
Discuss:
DEV
馃幆
Predictive Coding
Why
reinforcement
learning breaks at scale, and how a new method
fixes
it
techxplore.com
路
3d
馃
Neuromorphic Hardware
Performance
Tip
of the Week #94: Decision making in a
data-imperfect
world
abseil.io
路
8h
馃幆
Predictive Coding
i10e-lab/HelloRL
: A fully modular framework to make Reinforcement Learning quick and easy
github.com
路
1d
路
Discuss:
Hacker News
馃攧
Meta-Learning
Physics-Informed Neural Networks for
Inverse
PDE
Problems
pub.towardsai.net
路
15h
馃
Machine Learning
Personalized Adaptive Feedback System for Early Detection and Intervention of Fine鈥慚otor Skill Development in
Preschool
Children Using Wearable
IMU
Sensors and Reinforcement Learning
freederia.com
路
2d
馃攧
Meta-Learning
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
路
2d
路
Discuss:
Hacker News
馃幆
Predictive Coding
Tips
lonestation.itch.io
路
12h
馃М
Algorithms
Barn
Owls
Know When to Wait (
iuSTDP
part 2)
blog.typeobject.com
路
9h
路
Discuss:
Hacker News
馃
Neuromorphic Hardware
On
Economics
of A(S)I Agents
lesswrong.com
路
11h
馃
Neuromorphic Hardware
learning by
reverse
engineering
clymup.com
路
16h
馃攧
Meta-Learning
Exploiting
large language model with reinforcement learning for generative job
recommendations
eurekalert.org
路
2d
馃幆
Predictive Coding
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help